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ORIGINAL ARTICLE 

Transcriptome organization for chronic alcohol abuse in 
human brain 

SP Farris\ D Arasappan^ S Hunicke-Smith^ RA Harris^ and RD Mayfield^ 

Alcohol dependence is a heterogeneous psychiatric disorder characterized by high genetic heritability and neuroadaptations 
occurring from repeated drug exposure. Through an integrated systems approach we observed consistent differences in 
transcriptome organization within postmortem human brain tissue associated with the lifetime consumption of alcohol. Molecular 
networks, determined using high-throughput RNA sequencing, for drinking behavior were dominated by neurophysiological 
targets and signaling mechanisms of alcohol. The systematic structure of gene sets demonstrates a novel alliance of multiple ion 
channels, and related processes, underlying lifetime alcohol consumption. Coordinate expression of these transcripts was enriched 
for genome-wide association signals in alcohol dependence and a meta-analysis of alcohol self-administration in mice. Further 
dissection of genes within alcohol consumption networks revealed the potential interaction of alternatively spliced transcripts. For 
example, expression of a human-specific isoform of the voltage-gated sodium channel subunit 5CN4B was significantly correlated 
to lifetime alcohol consumption. Overall, our work demonstrates novel convergent evidence for biological networks related to 
excessive alcohol consumption, which may prove fundamentally important in the development of pharmacotherapies for alcohol 
dependence. 
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INTRODUCTION 

Dysregulation of neurocircuitry and brain structure through 
altered neurotransmitter systems, endogenous peptides, hormone 
systems and neuroimmune processes has been implicated for 
alcohol and other drugs of abuse.^"^ These numerous biological 
events call attention to the need for large-scale analyses to 
uncover the diverse pathways and mechanisms that are most 
prominent in addiction. Responses to repeated environmental and 
chemical stimuli cause enduring central nervous system adapta- 
tions through coordinate actions of gene expression.^ Identifying 
end-stage central nervous system expression differences asso- 
ciated with human alcohol consumption will be instrumental in 
translating fundamental laboratory findings into human health. 

Human postmortem brain tissue is a valuable resource for 
determining the role of gene expression in neuropsychiatric 
diseases, and establishing a connection with experimental models.^ 
Whole-genome profiling, primarily using microarrays, has identi- 
fied subtle alterations in gene expression within discrete brain 
regions related to alcohol dependence.^'^ High-throughput seq- 
uencing technologies (RNA-Seq) offer novel insights into trans- 
criptome complexity, including non-coding RNAs (ncRNAs) and 
alternatively spliced variants, across different tissues and disease 
states.^ RNA-Seq analysis of postmortem human hippocampus has 
identified differentially expressed genes, potentially related to 
neuronal adaptations underlying long-term cocaine and alcohol 
abuse.^ These changes within human hippocampus are consistent 
with differential GABAergic gene expression in a rodent model of 
alcohol dependence.^° In addition, RNA-Seq has shown that 



alcohol dependence may modify alternative splicing of candidate 
genes, such as GABBRl, within the human brain." Continued 
examination of discrete brain regions through RNA-Seq will 
provide additional evidence of the transcriptional landscape for 
alcohol dependence and other psychiatric diseases. 

Coordinate expression of genes form highly organized biologi- 
cal networks within specific regions of human brain, which are 
reproducible across individuals.^^'^^ The homeostatic structure of 
molecular networks may be disrupted through genetic and 
environmental perturbations producing long-lasting interactions 
involved in complex traits.^"^ Integrating multiple components and 
layers of information within the framework of a biological network 
helps identify and prioritize individual transcripts, as well as sets of 
co-regulated transcripts associated with disease.^ ^ Owing to the 
long-standing cellular adaptations caused by substance depen- 
dence,^^'^'' we hypothesize that alcohol dependence is shaped, at 
least in part, by persistent differences in coexpression systems 
collectively affecting the propensity for compulsive alcohol con- 
sumption. Through massive parallel sequencing of RNA transcripts 
from postmortem human prefrontal cortex (PFC), a prominent 
brain area involved in addiction,^^'^^ our bioinformatics approach 
used transcriptional networks and genetic variation in conjunction 
with estimates of lifetime alcohol consumption to determine 
groups of coexpressed molecular elements associated with alcohol 
dependence. Our analysis provides convergent systems-level 
evidence of genetic networks within PFC that contribute to the 
pathophysiology of alcohol-drinking behavior in humans. 
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MATERIALS AND METHODS 

Case selection and postmortem tissue collection 
Human autopsy brain samples were obtained from the New South Wales 
Tissue Resource Centre at the University of Sydney. Fresh frozen samples of 
the superior frontal gyrus (Brodmann area 8; further referred to as PFC in 
this manuscript) were collected from each postmortem sample. Brain 
tissue was sectioned at 3-mm intervals in the coronal plane. Alcohol- 
dependent diagnoses were confirmed by physician interviews, review of 
hospital medical records, questionnaires to next-of-kin, and from patho- 
logy, radiology and neuropsychology reports. Tissue samples were 
matched as closely as possible according to age, sex, postmortem interval, 
pH of tissue, disease classification and cause of death. To be included as 
part of the alcohol-dependent cohort, subjects had to meet the following 
criteria: >18 years of age; no head injury at the time of death; lack of 
developmental disorder; no recent cerebral stroke; no history of other 
psychiatric or neurological disorders; suitable prolonged agonal state; no 
history of intraveneous or polydrug abuse; negative screen for AIDS 
(acquired immune deficiency syndrome) and hepatitis B/C; postmortem 
interval within 48 h; and diagnosis of alcohol abuse disorder meets the 
DSM-IV (Diagnostic and Statistical Manual of Mental Disorders, Fourth 
Edition) criteria. 

Sample preparation and read counting 

Total RNA from postmortem human PFC was extracted from frozen tissue, 
excluding any samples with contaminated or degraded quality (RNA 
integrity numbers < 5.0). Ribosomal RNA was depleted using RiboMinus 
Eukaryote kit (Life Technologies, Foster City, CA, USA) for RNA-Seq and 
confirmed using an Agilent Bioanalyzer (Agilent Technologies, Santa Clara, 
CA, USA). Samples were processed using ABI whole transcriptome library 
preparation kit and sequenced on the ABI SOLID 4 system (Life 
Technologies) using paired-end reads (35-H50bp). Collected reads were 
processed by the Texas Advanced Computing Center and mapped for 
sequence reads, allowing two mismatches per 25-bp seed length, against 
the human reference genome (hg19), to select unique alignments with the 
highest reproducible mapping. Read counts were generated using the 
Partek Genomics Suite software (Partek, St Louis, MO, USA) (a minimum of 
five reads/alignment was used to determine values). Abundance was 
calculated for full-length gene isoforms with an expectation-maximization 
algorithm.^° Detection of differential expression based on the negative 
binomial distribution for modeled read counts, and normalization using a 
regularized log2 transformation,^^ was conducted within the R project for 
statistical computing. Expression data encompassed all sequencing reads 
that were unambiguously mapped to a single gene within an individual 
sample. 

Bioinformatics analyses 

Scale-free coexpression networks were constructed using the weighted 
gene coexpression network analysis (WGCNA) package in R.^^ WGCNA pro- 
vides a global perspective, emphasizing the interconnectedness between 
genes to classify different molecular groupings, rather than focusing on 
individual genes. Alterations in the molecular networks are due to 
environmental and genetic variation affecting mechanisms of regulation. 
The molecular groups in WGCNA are defined as modules formed by 
densely interconnected genes, and alternatively spliced isoforms, which 
were discerned using a dynamic tree-cutting algorithm based on 
hierarchical clustering (minimum module size =100, cutting height = 0.99 
and deepSplit = TRUE).^^ Corresponding transcripts were assigned to 
unique numeric and color identifiers based on the level of read summari- 
zation, emphasizing the strongest pairwise relationships in expression 
across samples. Module preservation, and reproducibility, for alcoholic and 
matched control subjects was evaluated according to a Z-summary 
statistic with permutation.^^ Assigned modules were functionally anno- 
tated against known ontological categories, and additional features such 
as predicted drug interactions, using standard biological enrichment 
tools.^^'^^ Representative gene ontology plots were visualized using 
semantic dimension scatter plots to reduce redundancy in terms of the 
identified biological groups. The coordinates of x- and y-axes are assigned 
using multidimensional scaling of similarities in ontological terms among 
identified groups. Overall results for functional enrichment of gene 
modules (GMs) related to lifetime consumption can be found in Supple- 
mentary Table S2. Cell-type over-representation analysis was restricted to 
genes having ^10-fold enrichment in neurons, astrocytes or oligoden- 
drocytes,^^ permitting a rigorous assessment of gene clusters within the 



three main central nervous system cell types. Unless indicated otherwise, 
statistical significance of intersecting gene sets was evaluated using the 
Fisher's exact test. Visualizations were rendered using Cytoscape for 
biomolecular connections^^ and available R packages. 

Representative GMs were examined for group-wise over-representation 
of genetic variants within the database of genotypes and phenotypes 
(dbGaP)^^ for the public results of two alcohol-dependent studies and 
three non-psychiatric diseases as negative controls: Collaborative Study on 
the Genetics of Alcoholism (COGA; phs000125.v1.p1); Study of Addiction: 
Genetics and Environment (SAGE; phs000092.v1 .pi); National Eye Institute 
Age-Related Eye Disease Study for age-related macular degeneration and 
cataracts (phs000001.v3.p1); NIDDK IBD Genetics Consortium Crohn's 
Disease (phs000130.v1.p1); and Type 1 Diabetes Genetics Consortium 
(phs000180.v2.p2). Individual datasets were extracted from dbGaP, filtered 
for genetic variants with P-values :^0.01, and annotated for associated 
gene identifiers.^^ 



RESULTS 

RNA-Seq assessment of gene coexpression networks in PFC 
Paired-end sequencing (50/35 bp) fragments from ribosome- 
depleted RNA, spanning an average of -35 million alignments 
per sample, were collected from the PFC of 16 alcoholics and 15 
matched controls. Following filtering of raw reads for sequencing 
quality, RNA expression was detected using the RefSeq database^^ 
for 17 608 gene models, 29 607 alternatively spliced transcript 
models and 196 453 exons. Alcohol-dependent individuals varied 
in the duration and lifetime amount of alcohol consumed (Supple- 
mentary Figure 51); ranging from 20 857 to 325 893 standard 
drinks throughout the lifetime duration of alcohol-dependent 
consumption. Alcohol-dependent subjects did not significantly 
differ from control subjects in age, postmortem interval, brain pH 
level, brain weight, RNA quality (RIN) or number of sequence 
alignments (Supplementary Figure S2). 

Gene-level expression data for controls and alcoholics were 
assigned separately into organized clusters sharing highly similar 
expression patters (that is, modules) using a signed WGCNA.^^ The 
modular organization of coexpression systems is suggested as 
an important underlying aspect of cellular function and in the 
pathogenesis of disease.^^'^"^ Coexpression analysis was applied 
separately for controls and alcoholics to contrast network struc- 
ture between disease cohorts, and designate those PFC modules 
most related to excessive alcohol consumption within dependent 
individuals. Depicting expression data in the form of modules 
condenses a large number of expression characteristics into 
manageable units in order to clarify and prioritize systems within 
the human brain,^^"^^ as well as define representative expression 
profiles that can be associated with specific traits. Our analysis 
identified 32 and 38 gene expression modules for controls and 
alcoholics. RNA-Seq defined modules were significantly conserved 
in relation to previous network-based studies from human cortex 
(Supplementary Table SI), forming distinctive functional cate- 
gories that demonstrate consistency of networks with previous 
human brain studies. Alcoholic modules were further over- 
represented for a similar network analysis on alcohol dependence 
using microarray data (Supplementary Figure S3);^ however, more 
modules were discerned in our current investigation, which may 
be attributed to the precision, larger dynamic range and improved 
network characteristics involving RNA-Seq.^^ The normalized read 
counts could also be further summarized for transcript-level and 
exon-level information (Figure 1), providing an overall perspective 
of the transcriptome network structure. Neither controls or 
alcoholics showed any substantial clustering bias for brain weight, 
pH, postmortem interval, RIN, age or smoking (Supplementary 
Figure S4). To determine global differences in transcriptome 
architecture, we compared mean ranked expression and con- 
nectivity patterns between controls and alcoholics. Gene, tran- 
script and exon level expression was strongly correlated between 
controls and alcoholics (Figure la). In contrast the global 
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Figure 1. Global comparison of prefrontal cortex (PFC) transcriptonne architecture in alcoholic and nnatched controls, (a) Scatter plots of 
ranked expression (left) and connectivity (right) in alcoholics (n = 16) and nnatched controls {n = 15). The scatter plots dennonstrate that overall 
transcriptonne features are robustly correlated for overall expression for alcoholics (y-axis) and controls (x-axis), regardless of expression for 
genes, alternatively spliced transcripts or individual exons; however, disease is related to increasingly disparate patterns in overall connectivity 
across genes, alternatively spliced transcripts and individual exons. (b) Average connectivity of coding features is significantly reduced in 
alcoholic PFC connpared to controls, showing dinninished honneostatic interconnection annong genes (left), transcripts (nniddle) and exons 
(right). Bar plots show the nnean±s.e., with asterisks (^^^) denoting a significant difference between controls and alcoholics at P< 1e-16. 
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connectivity, a measure of inter-relationship among all the 
features within a biological network, demonstrated progressively 
weaker correlation (Figure 1a). The discrepancy between strong 
overall expression and weaker connectivity suggests the neuro- 
biology of alcohol dependence in PFC is due to altered co- 
variation of transcribed molecular features, rather than discrete 
differential expression across the transcriptome. Average global 
connectivity per feature was significantly reduced in alcoholics 
(Figure 1b), reflecting consistent homeostatic dysregulation for 
biological processes within alcohol-dependent PFC. 

In order to further refine GMs in the context of alcohol 
dependence, GM network structure was correlated with pheno- 
typic variation for lifetime consumption of alcohol across 
individual subjects. Establishing correspondence with a particular 
phenotypic trait (that is, alcohol consumption), rather than a 



clinical diagnosis alone, is biologically relevant to substance abuse 
and experimentally tractable in future studies. Correlation with 
lifetime alcohol consumption ranged from |r = 0.01 to r = 0.73|, 
with 10 individual GMs being in the upper and lower quartiles of 
the distribution (Figure 2a). The first principal component of GM 
expression within the upper quartile on average accounted for 
1 8.9% of the variance for lifetime consumption versus 1 .5% for the 
lower quartile, a 12.6-fold difference between the GM groups. 
Intermodular correlation is significantly higher among the upper 
quartile GMs associated with alcohol drinking compared to 
GMs residing in the lower quartile (f(88)=4.56, P=1.65e-05; 
Figure 2b). This correlation structure implies coherent biological 
function for a set of modules, comprising 2330 genes in PFC, 
directly related to alcohol-drinking behavior in humans. GMs 
within the upper quartile form two distinct groups (group 1: 
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Figure 2. Analysis of gene coexpression modules for estimates of lifetime alcohol consumption, (a) Ranking of alcoholic gene coexpression 
networks in relation to a specific phenotypic trait, the degree of lifetime alcohol drinking; dividing modules into larger cohorts which are most 
(upper quartile) and least (lower quartile) associated with human alcohol consumption, (b) Ranked intercorrelation of modules in the upper 
quartile (left) and lower quartile (right) of alcohol consumption. Heatmaps demonstrate modules within the lower quartile are largely 
unorganized whereas those in the upper quartile form two distinct factions. Mean absolute correlation among the two categories are 
significantly different f(88) = 4.56, P= 1.65e-05. (c) Average connectivity of matched gene coexpression patterns within the upper and lower 
quartiles for controls (blue) and alcoholics (red), showing the lifetime amount of alcohol consumed influences transcriptome architecture F 
(1,14964) = 1 1 1.5; P< 1.0e-16 leading to higher average connectivity patterns associated with disease (bar plots show the mean±s.e.). (d) 
Mosaic plot of the three major cell types in the mammalian brain (neurons: top, astrocytes: middle and oligodendrocytes: bottom) with 10- 
fold enrichment for the upper (leftj and lower (right) quartiles; size of boxes reflects the number of overlapping genes. The upper quartile 
cohort is significantly enriched for neuronal classified genes (P = 4.66e-23, Fisher's exact test), (e) Scatter plot of relevant over-represented 
gene ontology categories, plotted based on semantic similarity and scaled according to -logio P-values for the upper quartile (complete gene 
ontology results provided in Supplementary Table S2; see Materials and Methods for additional details). 
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GIV131, GM15, GIV125 and GIV120; and group 2: GM19, GMSO, GIV133, 
GIV128, GIV113 and Gl\/137s), whereas GMs within the lower quartile 
share almost no pronounced inter-relationship. Interestingly, there 
was a significant difference for the average connectivity of genes 
residing in the upper versus the lower quartile and disease state 
(Figure 2c). The interconnectedness of genes belonging to GMs 
weakly tied to alcohol consumption (that is, lower quartile) 
followed a similar pattern to global differences in transcriptome 
organization between control and alcoholic PFC (Figure lb), while 
those within GMs linked with alcohol consumption (that is, 
upper quartile) showed the reverse, with higher average gene 
connectivity in alcoholics compared to matched controls 
(F(1,14964) = 111.5, P< 1e-16). This suggests a subtle restructur- 
ing of the transcriptome in alcoholic PFC favoring GMs associated 
with an individual's lifelong pattern of alcohol intake. Owing to the 
unique composition of gene networks associated with lifetime 
consumption of alcohol, gene coexpression in the upper quartile 
is less preserved in controls than in the remaining groups (Supple- 
mentary Figure S5). Such a favorable arrangement of genes in the 
upper quartile could indicate an allostatic mechanism for repeated 
maladaptive alcohol consumption. Upper quartile GMs related to 
alcohol intake are mainly neuronal (Figure 2d) and comprised of 
ontological categories for genes involving synaptic plasticity, ion 
channel function, transmembrane transporters and intracellular 
signaling molecules (Figure 2e; Supplementary Table S2). 

GMs associated with lifetime consumption of alcohol 
A number of isolated model system approaches have implicated 
ion channels and signaling networks in the neurobiology of 
alcohol; however, such studies may overlook small responses or 
fail to capture widespread heterogeneous molecular effects in 
humans."^^ Our results demonstrate a markedly orchestrated 
network of these genes in human PFC related to the degree of 
lifetime alcohol consumption, providing a framework for decipher- 
ing multiple genes in the context of a specific phenotypic trait 
of alcohol dependence. Molecular function for these networks 
spans multiple dimensions, but tie together several relevant 
biological processes of alcohol exposure. Glutamate receptor 
activity (P = 3.86e-05), a known system underlying the PFC 
circuitry of compulsive behavior in addiction and a direct phar- 
macological target of alcohol,"^^ is just one of several functional 
categories enriched within GMs associated with lifetime alcohol 
consumption. 

Group 1 of the upper quartile was the least variable contingent 
of modules, with the greatest inter-module correlation among 
1030 genes (Supplementary Figure S6), promoting the observed 
differences in disease status and excessive alcohol consumption. 
Functional enrichment of elements contained within group 1 
largely accounted for gene ontologies corresponding to lifetime 
alcohol consumption within the upper quartile (Figure 2e). This set 
of transcripts within group 1 are preserved for neuronal protein- 
protein coexpression (P= 1.19e-36),^^ suggesting the GMs may 
extend beyond co-regulated RNA transcripts. In addition, this 
network substructure is further over-represented for a meta- 
analysis of alcohol-drinking behavior in mice (P = 4.59e-08),'^^ 
providing independent evidence for a cohesive group of genes 
involved in alcohol consumption. Despite the fact, our analysis has 
focused on molecular networks for lifetime alcohol consumption, 
a portion of these jointly expressed genes may be indirectly 
regulated by common substrates and closely associated endo- 
phenotypes governing the pathophysiology of alcohol depen- 
dence. MicroRNA miR-9, a post-transcriptional regulator of splice 
variation and neuroadaptations of alcohol tolerance (a hallmark of 
escalated alcohol drinking), targets a notable share of genes 
within group 1 (P = 9.17e-04). Noteworthy targets of miR-9 
include the BK potassium channel KCNMAl and GADl, compo- 
nents of neuronal excitability and alcohol behavioral 
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phenotypes."^^""^^ These results further highlight the utility of the 
network-based approach to concurrently identify multiple disease- 
relevant genes, an important tool given the multifaceted nature of 
alcohol use and other psychiatric disorders. 

Leveraging network properties for the pairwise relationship 
among genes allows for the identification of functionally relevant 
candidates in neuroplasticity and disease.'^^'^^ Although modules 
within group 1 share similar expression patterns, the individual 
gene sets may also contribute to differing states of disease pro- 
gression. We hypothesized that identifying genes within central 
positions (that is, hub genes) of the alcohol-perturbed modules 
would distinguish major, yet underappreciated, participants in the 
neurobiology of human alcohol consumption. GM15, GM20, GM25 
and GM31 were restricted to the top 10% of connected genes, 
highlighting the primary contributors within the network that are 
associated with lifetime alcohol consumption (Figure 3). Among 
the most central elements are glutamatergic and GABAergic 
receptors (GABRA3, GABBRl, GABBR2, GRIAl and GRIN2B), two 
neurotransmitter systems widely recognized to affect alcohol- 
induced behaviors.^^ In addition, a number of candidates within 
these networks represent novel determinants of alcohol depen- 
dence. Hub genes within these molecular networks conceivably 
act across species as 25/104 the core genes coincide with the 
aforementioned meta-analysis of alcohol-drinking behavior in 
mice."^^ Probing specific targets across species may highlight 
unknown pathways in the human brain relevant for alcohol use 
disorders. PAKl is one concurrent example, acting as a regulator of 
dendritic spine formation and remodeling, ^ '^^ and may be 
important in cortical development and neuroadaptations under- 
lying addiction.^^ 

Potential role of SCN4B transcripts in lifetime human alcohol 
consumption 

Concentrating on the top 10% of genes, within group 1 of the 
upper quartile, related to lifetime alcohol consumption identified 
three high-confidence genes (SCN4B, SYT13 and TBC1D9) in 
common between mice and humans (Figure 3). The voltage- 
gated sodium channel is a putative quantitative trait gene for 
alcohol-drinking behavior."^^'^^ Serving as an auxiliary partner to 
sodium channel alpha subunits within neurons, SCN4B can over- 
ride the effects of other beta subunits,^^ modify sodium channel 
firing rates^^ and structurally alter neuronal projections.^^ Currently 
there is only one recognized isoform of SCN4B in rodents, 
however, there are three coding isoforms of SCN4B in humans. 
Only the shortest coding variant (NM_001 142349) is significantly 
correlated to alcohol consumption (Figure 4). WGCNA of alterna- 
tively spliced transcripts closely mirrors pairwise gene expression 
profiles for biological function (Supplementary Table S3 and 
Supplementary Figure S7); NM_001 142349 being an appreciable 
phenotypic focal point within the network that is distinct from the 
other two coding isoforms of SCN4B. Differentially spliced trans- 
cripts expressed alongside NM_001 142349 included NM_014191 
(5CN8A; Supplementary Figure S7), a sodium channel alpha 
subunit expressed within cortical nodes of Ranvier, dendrites 
and neuronal synapses.^^ Positive coexpression of 5CN8A and 
SCN4B isoforms, plus additional transcripts, within alcoholic PFC 
denotes a specific functional component influencing the neuronal 
circuitry of lifetime alcohol consumption. 

Convergent evidence from genome-wide association studies 
Comparing the coexpression networks of alcoholics and non- 
alcoholics, in terms of lifetime consumption, we identified a 
coordinated set of multiple factors underlying synaptic dysregula- 
tion in disease. The identified groups of coexpression modules 
were further evaluated against genome-wide association studies 
for alcohol dependence (COGA and SAGE) to asses whether these 
gene sets were applicable to a larger cohort of afflicted 
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Figure 3. Network diagrams for group 1 gene coexpression modules associated with lifetime alcohol consumption. Visualization 
demonstrates the interconnection of genes for the top 10% of module members with ^lr = 0.801 correlation strength. Relative size of the 
depicted networks is based on the size of the four modules. Size of the nodes reflects the module connectivity ranking for individual genes. 
The degree of node opacity is proportional to the correlation with lifetime consumption, with alcohol hubs defined as those genes within the 
top 10% of alcohol-associated genes in our human sample. Networks were further overlapped using a prior meta-analysis of alcohol-drinking 
behavior in mice^^ to identify potential points of convergent validity in rodent models. 
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Figure 4. Correlation of the quantitative trait gene candidate SCN4B (voltage-gated sodiunn channel type-IV beta subunit) isofornns. Scatter 
plots for three coding variants (Nl\/l_001 142349, Nl\/l_001 142348 and Nl\/l_1 74934) of SCN4B, dennonstrating a significant negative correlation 
for only the shortest coding isofornn (Nl\/l_001 142349) for lifetinne alcohol consunnption in hunnans. 



individuals. Group 1 of the GMs linked to lifetinne consunnption of 
alcohol was the only expression ensemble significantly enriched 
for genes containing single nucleotide polymorphisms associated 
with alcohol dependence (Figure 5), suggesting genes within this 
cohort may have a decisive affect on the development of an 
alcohol use disorder. The coordinated network structure of 
alcohol-drinking behavior for GMs in group 1 is set apart from 
generalized differential expression in disease status, emphasizing 
the network schema of a designated phenotype over non-specific 
changes in gene expression for alcoholism. 

DISCUSSION 

Alcohol dependence, similar to other psychiatric diseases, 
manifests through the interaction of multiple genes and signaling 
pathways. Disrupting key genes within the network may affect a 
broad range of molecular mechanisms and modify downstream 
phenotypes. Through a system-based approach of transcriptome- 
wide expression from human postmortem PFC, our analysis 
highlights comprehensive differences in sustained pairwise 
expression profiles related to an alcohol use disorder. The 
transcriptional landscape of alcoholic PFC formed a coordinate 



biological network associated with the lifetime consumption of 
alcohol. The internal structure of the network intertwined known 
and unknown targets related to the pathophysiology of alcohol- 
drinking behavior, offering a comprehensive assessment for the 
widespread pharmacological actions of repeated alcohol exposure 
in human brain. 

Diverse genetic and environmental effects influence patterns of 
alcohol use in humans and animal models,^°'^^ with 50-60% of the 
risk of alcoholism being a consequence of genetic factors.^^'^^ 
Alcohol consumption measures are reliable and valid indices of 
alcohol usage in human reports.^^'^^ Gene networks affiliated with 
lifetime alcohol consumption were enriched for genetic associa- 
tion signals involving alcohol dependence (Figure 5), providing 
convergent evidence for an aggregate of genes as causal 
determinants in disease progression. Disease- specific sequence 
variant commonalities were modest, but may be due to localiza- 
tion of single nucleotide polymorphisms within non-coding 
regions of DNA.^^ Sequencing of DNA and RNA isolated from 
the same subjects could provide even greater detail of the genetic 
architecture of transcriptional variation and subsequent beha- 
vioral traits. Prior studies on candidate genes, such as AUTS2, have 
demonstrated a potential role for intronic single nucleotide 
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Figure 5. Over-representation for single nucleotide polynnorphisnns (SNPs) fronn genonne-wide association studies (GWAS) related to alcohol 
dependence. Enrichnnent of groups (left to right) upper quartile group 1, upper quartile group 2, lower quartile and differentially expressed 
genes within two GWAS involving alcohol dependence (COGA — Collaborative Studies on Genetics of Alcoholisnn and SAGE — Study of 
Addiction: Genetics and Environnnent) and three non-psychiatric negative control datasets (Neg.1 — age-related nnacular degeneration and 
age-related cataracts, Neg.2 — Crohn's disease and Neg.3 — type 1 diabetes). The -logio P-value is plotted along the y-axis, with dashed 
horizontal lines designating the cut-off level for over-representation at P = 0.05, 0.01 and 0.001 . Only the aggregate of genes within group 1 of 
the upper quartile associated with lifetinne alcohol consunnption is significantly over-represented for SNPs underlying alcohol dependence. 



polymorphisms in alcohol consumption.^^ Allelic variation within 
the intronic single nucleotide polymorphism rs6943555 of AUTS2 
was also associated with altered mRNA expression in human 
PFC.^'' Transcriptome meta-analysis of mice differing in voluntary 
alcohol consumption also showed expression differences of 
Autsi, which is located within a quantitative trait loci for alcohol 
preference between high alcohol preferring (HAP1) and low 
alcohol preferring (LAP1) mice."^^ Furthermore, downregulation of 
an AUTS2 homolog in Drosop/?/7c7-blunted alcohol sensitivity, 
providing cross-species evidence for the role of AUTS2 in 
alcohol-related behavioral traits.^^ AUTS2 is a member of the 
upper quartile GMs described herein for lifetime alcohol con- 
sumption, illustrating a specific gene within the network that has 
been validated in independent studies for alcohol consumption. 

GMs in the upper quartile of lifetime alcohol consumption 
constituted two main groups (Figure 2), with only group 1 being 
enriched for signals from genome-wide association studies of 
alcohol dependence. The lack of genetic association within GMs 
belonging to group 2 may by and large suggest a non-genetic, yet 
biologically related to group 1, set of molecular components 
affecting the neurobiology of alcohol intake. Chronic alcohol con- 
sumption, genetic factors and associated environmental influ- 
ences may bring about coordinately regulated molecular adapta- 
tions supporting brain function in addictive processes. GM19, a 
group 2 module, consists of genes encoding central nervous 
system white-matter proteins (MOBP, OPALIN, UGT8 and ERMN). 
Abnormal white-matter expression in the PFC plays an essential 
role in brain morphology and behavioral responses to alcohol and 
other drugs of abuse.^^"^° Alterations in myelin-related proteins 
are capable of distorting axon-glial interactions and clustering of 
ion channels responsible for neuronal conduction,^^ a prominent 
biological feature of the GMs identified with association to 
alcohol-drinking behavior. Group 2 also contains long ncRNAs 
such as NCRNA00092, NCRNA00174 and NCRNA00284. The func- 
tion of these ncRNAs is currently unknown, but may be important 
in gene regulation and human disease.^^ The network of long 



ncRNAs may indicate a cooperative role in cellular plasticity that 
shapes patterns of human alcohol consumption. 

Alternative splicing of transcripts occurs for -95% of multiexon 
genes in humans and produces substantial functional diversity 
and flexibility in complex phenotypes. RNA-Seq has the ability to 
delineate human-specific gene isoforms relevant to neurobiology 
and disease. Few studies to date have conducted large-scale 
studies on alternative splicing for alcohol consumption. Lack of 
existing studies on alternative splicing for alcohol use disorders 
across populations limits the ability to compare the results herein. 
The shortest coding version of the reported QTG SCN4B is one 
example significantly correlated to lifetime consumption and is 
part of a transcriptional network that may alter neurophysiological 
responses to alcohol. Although further experiments are necessary 
to probe molecular interactions with alcohol and yet unforeseen 
physiology of splice variants, our analysis provides evidence 
within a network-based infrastructure involving multiple adjoining 
isoforms for alcohol dependence. For example, activity of SCN8A 
transcripts, the density of which is maintained by the composition 
of white matter,^^ may be altered by interaction with SCN4B 
isoforms and in turn collectively affect neuronal properties among 
a network of receptor-mediated systems. RNA-Seq studies are 
poised to reveal novel transcriptome dynamics for a variety of 
diseases and phenotypes; however, independent studies from 
other model systems may extend RNA-Seq findings as additional 
tools and resources become available over time. RNA-Seq studies 
are restricted to the materials available for transcriptome profiling. 
Limited access to human brain tissue for alcohol dependence, as 
well as other neuropsychiatric conditions, often precludes investi- 
gation of temporal transcriptome changes in response to repeated 
substance abuse or environmental differences. Characterizing the 
transcriptome of postmortem brain tissue represents an end-stage 
molecular phenotype, the cumulative effects on the transcriptome 
within an individual. Expanding RNA-Seq efforts across a greater 
cross-section of individuals and brain regions may strengthen our 
understanding of molecular networks influencing alcohol-drinking 
behavior, but across this representative sample of alcoholics 
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emphasizes a broader picture of transcriptome differences within 
PFC that are related to disease. Transcriptome signatures defined 
by GMs in group 1 for lifetime alcohol consumption contain 
targets for several pharmaceutical agents that have the potential 
to curb alcohol intake, including the prescription drug gabapentin 
(P = 4.91e-07). A randomized clinical trial for alcohol dependence 
has previously shown the effectiveness of gabapentin on heavy 
drinking and prolonging abstinence/^ Combining clinical and 
transcriptome results demonstrates a practical utility of this and 
related studies to determine potential pharmacotherapies for 
alcohol abuse. Polymorphisms conferring the risk of developing 
alcohol dependence may influence network substructures related 
to alcohol-drinking behavior and dictate logical treatment options. 
The results of our integrative study emphasize the power of high- 
throughput sequencing of human postmortem brain tissue as an 
intermediate resource for uncovering novel molecular mechan- 
isms in human disease and as a means of informing personalized 
medicine for the treatment of alcohol use disorders. 
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